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PROCESSOR WITH SCRIPT-BASED PERFORMANCE MONITORING 
Field of the Invention 

5 The present invention relates generally to data communication devices and systems, and more 

particularly to network processors or other types of processors utilizable in conjunction with 
processing operations, such as routing or switching, performed on packets or other protocol data 
units (PDUs). 

10 Background of the Invention 

A network processor generally controls the flow of packets between a physical transmission 
medium, such as a physical layer portion of, e.g., an asynchronous transfer mode (ATM) network 
or synchronous optical network (SONET), and a switch fabric in a router or other type of packet 
switch. Such routers and switches generally include multiple network processors, e.g., arranged in 
1 5 the form of an array of line or port cards with one or more of the processors associated with each of 
the cards. 

Certain network processors may be configured to support the processing of a variety of 
different types of data traffic, such as ATM cells, Internet Protocol (IP) packets, as well as other 
types of packet-based traffic. The ATM traffic is typically in the form of fixed-length 53-byte cells, 
20 each including a 5 -byte header and a 48-byte payload, while the IP traffic generally comprises 
variable-length packets. ATM cells and IP packets may be viewed as examples of what are more 
generally referred to herein as PDUs. 

Operation, administration and maintenance (OAM) functions within the network processor 
may be provided in accordance with an established protocol, such as ITU-T Recommendation 1.610, 
25 "B-ISDN Operation and Maintenance Principles and Functions," February 1999, which is 
incorporated by reference herein. 

One type of OAM function relates to performance monitoring (PM). OAM performance 
monitoring in the ATM context generally involves in-service monitoring of cell traffic on an ATM 
connection to detect errors and to collect statistical information regarding network reliability. OAM 
30 performance monitoring is initiated and released by respective activation and deactivation sequences. 
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Once initiated, one or more OAM forward performance monitoring (FPM) cells are inserted into a 
stream of user cells on a given connection. The network processor which receives an FPM cell must 
calculate errors and statistics for a corresponding block of user cells. Results are sent back to the 
originator of the OAM PM flow as a backwards reporting (BR) cell. 
5 A problem that arises in utilizing OAM performance monitoring in a network processor is 

that such monitoring is difficult to implement in a manner that provides sufficient flexibility for 
statistical calculation without degrading overall system performance. Typical conventional 
approaches involve implementing OAM performance monitoring either entirely in the form of one 
or more hardware state machines, or entirely in software executed by a host processor associated 

1 0 with the network processor. 

The hardware state machine approach is fast but not very flexible. For example, it fails to 
provide a sufficient level of programmability regarding the particular statistics to be collected, and 
usually offloads the statistics to a host processor for later collection. 

The host processor software approach is very flexible, but may adversely impact overall 

15 system performance since the software must update all the statistics and then generate the 
appropriate BR cells. 

It is therefore apparent that a need exists for an improved technique for implementing OAM 
performance monitoring in a network processor. 

20 Summary of the Invention 

The invention provides improved performance monitoring for processing received cells, 
packets or other PDUs in a network processor or other type of processor, in a manner that avoids the 
difficulties associated with the conventional approaches described above. 

In accordance with one aspect of the invention, a network processor or other type of 
25 processor includes controller circuitry operative to control performance monitoring for a particular 
flow of PDUs received by the processor. The controller circuitry includes a classifier and is 
operative to access memory circuitry associated with the processor. The classifier is configured to 
perform at least a first pass classification of at least a subset of the PDUs. The controller circuitry 



2 



Brown 7-3-1 

in conjunction with a first pass classification of a PDU of a first type is operative to execute a first 
script, and in conjunction with a first pass classification of a PDU of a second type is operative to 
execute a second script different than the first script. The PDUs of the first and second types may 
comprise, by way of example, ATM user cells and OAM FPM cells, respectively. A result of 
5 execution of at least one of the first and second scripts is stored in the memory circuitry. A 
performance monitoring output is generated, responsive to receipt of the protocol data unit of the 
second type, based at least in part on the result of execution of at least one of the first and second 
scripts. 

The performance monitoring output may be generated in conjunction with a second pass 
1 0 classification of the PDU of the second type, upon execution of an additional function or other type 
of script. This additional execution may cause the retrieval of a result of execution of the second 
script from the memory circuitry, with the performance monitoring output being generated based at 
least in part on the result of execution of the second script. 

The controller circuitry may comprise, in addition to the classifier, a compute engine for 
15 executing one or more of the scripts, and a traffic manager for generating the performance 
monitoring output. The compute engine may be implemented as a portion of a state engine of the 
processor, or using other suitable arrangements of hardware, software or firmware. 

In the illustrative embodiment, the performance monitoring output comprises one or more 
backwards reporting (BR) cells generated in a manner compliant with OAM performance monitoring 
20 as specified in the above-cited 1.610 protocol. 

The processor may be configured as a network processor integrated circuit to provide an 
interface between a network and a switch fabric in a router or switch. 

Advantageously, the techniques of the invention in the illustrative embodiment process 
received PDUs in a manner that provides a high degree of flexibility in terms of statistics collection 
25 without undermining system performance. 
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Brief Description of the Drawing s 

FIG. 1 is a simplified block diagram of an information processing system in which the 
present invention is implemented. 

FIG. 2 shows a more detailed view of a portion of the FIG. 1 system illustrating the 
5 implementation of performance monitoring therein in accordance with the invention. 

FIG. 3 is a flow diagram of performance monitoring operations implemented in the FIG. 1 
system in accordance with the techniques of the invention. 

FIG. 4 illustrates one possible implementation of a network processor of the FIG. 1 system 
as an integrated circuit installed on a line card of a router or switch. 

10 

Detailed Description of the Invention 

The invention will be illustrated herein in conjunction with an exemplary information 
processing system which includes a network processor configured in a particular manner. It should 
be understood, however, that the invention is more generally applicable to any processor in which 
15 it is desirable to provide more efficient performance monitoring than is possible using the 
conventional techniques described above. 

A "processor" as the term is used herein may be implemented, by way of example and 
without limitation, utilizing a microprocessor, central processing unit (CPU), digital signal processor 
(DSP), application-specific integrated circuit (ASIC), or other type of data processing device, as well 
20 as portions or combinations of these and other devices. 

The term "controller circuitry" as used herein is intended to include any portion of a 
processor or other type of logic or processing circuitry which is capable of performing or otherwise 
controlling at least a portion of an operation associated with performance monitoring in the 
processor. 

25 The term "memory circuitry" as used herein is intended to include a separate memory as well 

as a particular portion of a larger memory, and may refer to internal memory, external memory or 
combinations thereof. 
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The terms "protocol data unit" and "PDU" as used herein are intended to include a cell, a 
packet, or other identifiable grouping of information. 

The present invention in an illustrative embodiment is configured such that a network 
processor includes controller circuitry configurable to control processing operations associated with 
5 performance monitoring for one or more flows received by the network processor. The processing 
operations are preferably implemented as script-based operations within a compute engine of the 
controller circuitry. 

FIG. 1 shows an information processing system 100 in which the invention is implemented. 
The system 100 includes a network processor 102 having an internal memory 104. The network 

10 processor 102 is coupled to an external memory 106 as shown, and is configured to provide an 
interface between a network 108 from which PDUs are received and a switch fabric 110 which 
controls switching of PDU data. The processor 102 and its associated external memory 106 may be 
implemented, e.g., as one or more integrated circuits installed on a line card or port card of a router 
or switch. In such a configuration, the switch fabric 1 10 is generally considered to be a part of the 

15 router or switch. 

One or both of the internal and external memories 104, 106 may be utilized to implement 
otherwise conventional network processor memory elements such as PDU buffer memory, queuing 
and dispatch buffer memory, etc. 

Also coupled to the network processor 102 is an associated host processor 1 12. The host 

20 processor 1 12 may be configured to communicate with the network processor 102 over a standard 
bus architecture, e.g., a peripheral component interconnect (PCI) bus. 

In accordance with an aspect of the invention, the network processor 1 02 includes controller 
circuitry comprising a classifier 1 14, a traffic manager 116 and a state engine 118. These elements 
are configured to perform operations associated with performance monitoring, as will be described 

25 in greater detail in conjunction with FIGS. 2 and 3 below. It should be appreciated, however, that 
this particular arrangement of controller circuitry elements is by way of illustrative example only, 
and other types and arrangements of elements may be used in alternative embodiments of the 
invention. 
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The classifier 1 14 is preferably configured to perform a two-pass classification or other type 
of multiple-pass classification of PDUs received by the network processor 102. Examples of 
multiple-pass classification techniques suitable for use with the present invention may be found in 
U.S. Patent Application Serial No. 10/029,703, filed December 21, 2001 and entitled "Processor 
5 with Multiple-Pass Non- Sequential Packet Classification Feature," which is commonly assigned 
herewith and incorporated by reference herein. However, the invention does not require the use of 
multiple-pass classification. The term "first pass classification" as used herein is therefore intended 
to include not only a first pass of a multiple-pass classification, but also the only pass of a single-pass 
classification. 

10 It will be assumed without limitation that, in the illustrative embodiment, the performance 

monitoring functionality of the network processor 102 may be provided in a manner that conforms 
to an established protocol, such as the above-noted ITU-T Recommendation 1.610 for OAM 
functions. However, the invention does not require the use of this particular protocol, or any other 
standard protocol, and can be configured using non-standard OAM techniques as well as 

1 5 combinations of standard and non-standard OAM techniques. 

The particular arrangement of system elements shown in FIG. 1 is by way of illustrative 
example only. More particularly, as previously noted, the invention can be implemented in any type 
of processor, and is not limited to any particular PDU processing application. Also, the system 100 
and network processor 102 as illustrated in FIG. 1 are considerably simplified for clarity of 

20 discussion, and may include other elements in addition to or in place of those specifically shown, 
including one or more elements of a type commonly found in a conventional implementation of such 
a system and network processor. For example, the network processor may include queuing and 
dispatch logic, one or more memory controllers, interface circuitry for interfacing the network 
processor with the network 108, the switch fabric 110, the host processor 112 and other external 

25 devices, as well as other conventional elements not explicitly shown in the figure. These 
conventional elements, being well-understood by those skilled in the art, are not described in detail 
herein. 
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The performance monitoring functionality of the network processor 1 02 as described herein 
may be implemented at least in part in the form of software program code. For example, the 
controller circuitry comprising classifier 114, traffic manager 116 and state engine 118 may be 
implemented at least in part utilizing elements that are programmable via instructions or other 
5 software that may be supplied to the network processor 102 via the host processor 112 or other 
suitable mechanism. 

FIG. 2 shows a more detailed view of a portion 200 of the system 100, illustrating the 
operation of the classifier 1 14, traffic manager 1 16 and state engine 1 18 of the network processor 
102 in providing performance monitoring functionality within the system 100. It should be noted 

10 that the elements 1 14, 1 16 and 118 may perform other operations within the network processor, of 
a type conventionally performed by such elements but not otherwise associated with the performance 
monitoring operations illustrated in FIG. 2, and that these other operations may be implemented in 
a manner known to those skilled in the art. 

The classifier 1 14 in the illustrative embodiment will be assumed to be a two-pass classifier, 

15 as was previously indicated, although other types of classifiers can be used. 

The state engine 118 as implemented in the network processor 102 includes a compute 
engine 202. The compute engine controls the storage of per-flow OAM performance monitoring 
statistic information in a storage element 204, and also accesses an FPM return circular first-in-first- 
out (FIFO) storage element 206. The storage elements 204, 206 are preferably implemented as 

20 portions of the external memory 1 06, but one or both may be implemented as portions of the internal 
memory 104, or using combinations of internal and external memory. 

Although the compute engine 202 is shown in FIG. 2 as being implemented as a portion of 
a state engine of the network processor 1 02, other suitable arrangements of one or more of hardware, 
software and firmware may be utilized in implementing the compute engine. Also, other types of 

25 execution mechanisms suitable for executing scripts may be used in conjunction with or in place of 
compute engine 202. 

The performance monitoring functionality of the network processor 1 02 as illustrated in FIG. 
2 generally includes operations denoted as steps (1) through (5) in the figure. 
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Certain of these steps involve the use of functions, such as functions denoted herein as 
OAM_COUNT, OAMFPM and OAM_FPM_RETURN, which are executable by the compute 
engine 202 or otherwise executable by the network processor 102. Such functions are examples of 
what are more generally referred to herein as "scripts." The term "script" as used herein is intended 
5 to include without limitation any function or other type of program code executable in a network 
processor or other type of processor. 

Step (1), which is not explicitly shown in FIG. 2, involves activation of the performance 
monitoring functionality of the network processor 102 for a given PDU flow or set of PDU flows. 
This activation may be controlled via activation sequence software running on the host processor 

10 112. The activation process in the illustrative embodiment may specify particular processing scripts 
for use by the compute engine, the number of user cells between FPM cells, and other information. 
A PDU flow for which OAM performance monitoring has been activated is also referred to herein 
as an OAM flow or an OAM PM flow. The term "user cell" in this context is intended to include 
any type of cell other than an FPM cell. 

1 5 In operation, user cells and FPM cells are received on a given OAM PM flow by the classifier 

1 14, and in a first pass of the classifier corresponding functions OAMCOUNT and OAMFPM are 
called for each of the respective user cells and FPM cells. More specifically, each user cell of the 
OAM flow when received in the classifier results in a call of the OAM_COUNT function in 
conjunction with first pass classification of that user cell, and each FPM cell of the OAM flow when 

20 received in the classifier results in a call of the OAM_FPM function in conjunction with first pass 
classification of that FPM cell. 

The OAM_COUNT function may be configured, by way of example, to count the number 
of cells, accumulate parity over a block of user cells, and store the information back to memory. 
The OAM_FPM function may be configured, by way of example, to take data from an FPM 

25 cell and results generated by the OAMCOUNT function calls, and to calculate a corresponding BR 
result. The BR result may indicate how many user cells were received and if any errors occurred, 
and may also be used to generate other statistics. 
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It should be noted that these and other functions described herein are merely examples of 
processing scripts that may be used in implementing the invention. A wide variety of other scripts 
may be used, as will be appreciated by those skilled in the art. 

The above-noted function calls are passed from the classifier to the compute engine of the 
5 state engine. This portion of the process corresponds generally to Step (2) in the figure. The figure 
also shows a single FPM cell 210 arriving at an input of the classifier. 

As noted previously herein, FPM cells are interspersed with user cells in OAM performance 
monitoring, and receipt of an FPM cell triggers the calculation of errors and statistics for a 
corresponding block of user cells. Results are sent back along the connection as a backwards 
10 reporting (BR) cell. 

The compute engine 202 upon receipt of the OAM_FPM function call associated with a 
given FPM cell executes the function, which calculates errors and statistics over the appropriate 
block of user cells and then writes a BR result to the head of the circular FIFO 206. The per-flow 
OAM PM statistic information is written to the storage element 204. This portion of the 
15 performance monitoring process is indicated generally as Step (3) in the figure. 

Second pass classification of the given FPM cell in the classifier 1 14 results in a call of an 
OAM_FPM_RETURN function. Execution of the function causes the BR result to be retrieved or 
"popped" from the tail of the circular FIFO 206, and returned to the classifier 1 14 for delivery to the 
traffic manager 116 along with the FPM cell. This portion of the process corresponds generally to 
20 Step (4) in the figure. 

The OAMFPMRETURN function in the illustrative embodiment is implemented as a 
function that is not executed by the compute engine 202, but is instead executed elsewhere in the 
network processor 102. In other embodiments, the OAMFPMRETURN function could be 
implemented as a function or other type of script executed by the compute engine. 
25 The traffic manager receives the FPM cell and the BR result and formats a corresponding BR 

cell 212, which may be delivered to the switch fabric 1 10, for example, so that it may be delivered 
back to the originator of the OAM PM flow. This portion of the process corresponds generally to 
Step (5) in the figure. 
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FIG. 3 is a flow diagram 300 which summarizes the processing operations associated with 
performance monitoring in the network processor 102 in the illustrative embodiment of the 
invention. Again, the performance monitoring is assumed to be implemented in a manner compliant 
with the above-cited 1.610 OAM protocol. 
5 In step 302 , the performance monitoring functionality is activated. 

In step 304, the OAM_COUNT function is called for each user cell received on a given OAM 
PM flow, in conjunction with first pass classification of the user cell in the classifier 1 14. 

In step 306, the OAM FPM function is called for an FPM cell received on the OAM PM 
flow, in conjunction with first pass classification of the FPM cell in the classifier 114. As noted 
1 0 above, this function calculates errors and statistics over a block of user cells and writes a BR result 
to circular FIFO 206, preferably in external memory 106. 

In step 308, the OAMFPMRETURN function is called in conjunction with second pass 
classification of the FPM cell in the classifier 114. As noted above, this function when executed 
retrieves the BR result from the circular FIFO 206. The BR result is returned to the traffic manager 
15 116 along with the FPM cell, via the classifier 1 14. 

In step 3 1 0, the traffic manager formats the original FPM cell and the BR result into an OAM 
BR cell which may be transmitted to the switch fabric 1 10. 

The particular steps shown in FIG. 3 should be considered as examples of performance 
monitoring processing operations in accordance with the illustrative embodiment of FIGS. 1 and 2, 
20 rather than as limiting the scope of the invention in any way. For example, it is possible to 
implement the invention utilizing more scripts or fewer scripts than the particular number used in 
FIGS. 2 and 3. 

An advantage of the performance monitoring implementation in the illustrative embodiment 
described above is that it provides improved flexibility in the statistics collected for a given OAM 
25 PM flow. Since the OAM statistics and BR cell information are calculated on the first pass of ATM 
cell processing, where ATM cells are typically reassembled into larger PDUs, the BR result can be 
written into a FIFO that can be read later while the reassembled PDU is "replayed" to the traffic 
manager. 

10 
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This decoupling of first pass and second pass processing in the illustrative embodiment 
decreases the system performance impact of generating OAM BR cells, while also providing 
enhanced flexibility in performance monitoring. For example, the invention can provide full 
programmability of the particular OAM statistics to be collected, while also ensuring that the 
5 network processor experiences no significant degradation in performance when OAM performance 
monitoring is running with normal traffic. 

The network processor 1 02 may be configured as an integrated circuit to provide an interface 
between a network and a switch fabric in a router or switch. 

FIG. 4 illustrates an example router or switch line card embodiment of a portion of the 
10 system 100 of FIG. 1. In this embodiment, the processing system comprises a line card 400 having 
at least one integrated circuit 402 installed thereon. The integrated circuit 402 comprises network 
processor 102 which has internal memory 104. The network processor 102 interacts with external 
memory 1 06 on the line card 400. The host processor 1 1 2 may also be installed on the line card 400, 
although it is not shown in the figure. 
1 5 The portion of the processing system as shown in FIG. 4 is considerably simplified for clarity 

of illustration. It is to be appreciated, however, that the processing system may comprise a router 
or switch which includes multiple such line cards, and that each of the line cards may include 
multiple integrated circuits. A similar embodiment may be implemented in the form of a port card 
in a router or switch. 

20 The above-described embodiments of the invention are intended to be illustrative only. For 

example, although the illustrative embodiment of FIG. 1 utilizes classifier 1 14, traffic manager 1 16 
and state engine 1 1 8 in implementing processing operations associated with performance monitoring, 
other embodiments may utilize different types of controller circuitry for implementing such 
operations. In addition, other embodiments can use different types of functions or other scripts, and 

25 other execution mechanisms, for implementing the described functionality. A performance 
monitoring output may be generated based on a result of execution of a single script or results of 
execution of multiple scripts. Also, the invention does not require the use of any particular 
performance monitoring protocol or results reporting format. These and numerous other alternative 
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embodiments within the scope of the following claims will be readily apparent to those skilled in the 
art. 
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